首页> 外文OA文献 >Scalable Multi-Class Gaussian Process Classification via Expectation Propagation
【2h】

Scalable Multi-Class Gaussian Process Classification via Expectation Propagation

机译:基于期望传播的可扩展多类高斯过程分类

摘要

Gaussian processes are non-parametric models that can be used to carryout supervised and unsupervised learning tasks. As they are non-parametricmodels, their complexity grows with the number of data instances, and asa consequence, they can be used to explain complex phenomena associatedwith the training dataset. They are also very useful to introduce a prioriknowledge in the learning problem, because the characteristics that theycan describe are given by a covariance function. Finally, these models areBayesian models, thus they allow to obtain the uncertainty of the predictionsand perform model comparison in an automated way. Despite allthese advantages, in practice Gaussian processes have certain limitations.The first one is that the computations needed to train the model are onlytractable in regression problems with Gaussian additive noise, and for anyother case they need to be approximated. The other problem is their scalability,given that the training cost is cubic with respect to the number of observeddata points N. In this master thesis, we propose a method for multiclassclassification with Gaussian processes that scales well to very largedatasets. For that, it uses the Expectation Propagation algorithm, alongwith the Fully Independent Training Conditional approximation (which introducesM N pseudo-inputs), stochastic gradients and some extra assumptionsthat reduce the training cost to O(M3). Experimental resultsshow that this method is competitive with other approaches based on variationalinference.
机译:高斯过程是非参数模型,可用于执行有监督和无监督的学习任务。由于它们是非参数模型,因此其复杂度会随着数据实例数量的增加而增加,因此,它们可用于解释与训练数据集相关的复杂现象。它们对于在学习问题中引入先验知识也非常有用,因为它们可以描述的特征由协方差函数给出。最后,这些模型是贝叶斯模型,因此它们可以获取预测的不确定性并以自动化方式进行模型比较。尽管具有所有这些优点,但在实践中高斯过程仍然有一定的局限性。第一个是训练模型所需的计算仅在高斯加性噪声的回归问题中是可解决的,并且在任何其他情况下都需要进行近似。另一个问题是它们的可伸缩性,因为训练成本相对于观察到的数据点N的数量是立方的。在本硕士论文中,我们提出了一种利用高斯过程进行多分类的方法,该方法可以很好地扩展到非常大的数据集。为此,它使用了期望传播算法,以及完全独立的训练条件逼近(引入了M N个伪输入),随机梯度和一些额外的假设,这些假设将训练成本降低到O(M3)。实验结果表明,该方法与基于变分推理的其他方法相比具有竞争优势。

著录项

  • 作者

    Villacampa Calvo, Carlos;

  • 作者单位
  • 年度 2017
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号